FEBS Letters 583 (2009) 549-555 



Individual and common inhibitors of coronavirus and picornavirus main proteases 

Chih-Jung Kuo a,b , Hun-Ge Liu a , Yueh-Kuei Lo a , Churl-Min Seong c , Kee-In Lee c , 

Young-Sik Jung c *, Po-Huang Liang a b ’* 

a Institute of Biological Chemistry, Academia Sinica, Taipei 11529, Taiwan, ROC 
b Taiwan International Graduate Program, Academia Sinica, Taipei 11529, Taiwan 
c Korea Research Institute of Chemical Technology, Daejeon 305-606, Republic of Korea 


ARTICLE INFO 


ABSTRACT 


Article history: 

Received 5 November 2008 
Revised 17 December 2008 
Accepted 23 December 2008 
Available online 21 January 2009 

Edited by Hans-Dieter Klenk 


Keywords: 

Coronavirus 
Picornavirus 
3C protease 
Fluorescence assay 
High throughput screening 
Computer modeling 


Picornaviruses (PV) and coronaviruses (CoV) are positive-stranded RNA viruses which infect mil¬ 
lions of people worldwide each year, resulting in a wide range of clinical outcomes. As reported 
in this study, using high throughput screening against —6800 small molecules, we have identified 
several novel inhibitors of SARS-CoV 3CL pro with IC 50 of low pM. Interestingly, one of them equally 
inhibited both 3C pro and 3CL pro from PV and CoV, respectively. Using computer modeling, the struc¬ 
tural features of these compounds as individual and common protease inhibitors were elucidated to 
enhance our knowledge for developing anti-viral agents against PV and CoV. 

© 2009 Federation of European Biochemical Societies. Published by Elsevier B.V. All rights reserved. 


1. Introduction 

Picornaviruses (PV) are small nonenveloped RNA viruses with a 
single strand of genomic RNA of 7500-8000 nucleotides [1]. The 
members of PV include rhinoviruses (RV), enteroviruses (EV), cox¬ 
sackieviruses (CV), polioviruses, echoviruses, encephalomyocard- 
itis viruses, meningitis virus, foot and mouth viruses, hepatitis A 
virus, and so on. Among them, RV is the major cause of the com¬ 
mon cold, whereas EV and CV infection can cause hand, foot, and 
mouth diseases in human and animals. In severe cases, EV can 
damage the central nervous systems leading to viral meningitis, 
encephalitis, and severe myocarditis, as well as fatal pulmonary 
edema [2-5]. CV strain B is a major human pathogen that causes 
meningitis and mycocarditis leading to heart failure in young 
adults and congestive heart failure [6]. In these PV, a chymotryp- 
sin-like protease (named 3C pro ) is required to process polyproteins 
into mature proteins for viral replication, which represents a prom¬ 
ising anti-viral drug target [7]. 
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On the other hand, coronaviruses (CoV) are the positive- 
stranded RNA viruses with larger genome of 27-32 kb, which 
typically cause respiratory and enteric diseases, pneumonia, 
exacerbation of asthma, neurological symptoms, and myocarditis 
in humans and domestic animals. An outbreak of severe acute 
respiratory syndrome (SARS), caused by a novel human CoV, was 
spread from China to 29 countries in 2003, infecting a total of 
—8000 people and killing —800 patients [8]. SARS-CoV contains a 
3C-like protease (3CL pro ) analogous to the 3C pro of PV, responsible 
for processing two overlapping polyproteins, ppla (486 kDa) and 
pplab (790 kDa). Other members of human CoV including CoV- 
229E, CoV-OC43, CoV-HKUl, and CoV-NL63 also require a 3CL pro 
in the maturation of viral proteins. 

Several inhibitors have been developed to inhibit the 3C pro of RV 
and EV [9-12] and 3CL pro of SARS-CoV [13-15]. However, their 
inhibitors can not be mutually used without modification. For 
example, AG7088, a potent inhibitor of RV and other picornaviral 
3C pro [16], failed to inhibit SARS-CoV 3CL pro [17]. Unlike the 3CL pro , 
which is dimeric and in which each subunit is composed of three 
domains, the 3C pro is a monomer with only the two catalytic 
domains. The structure-based sequence alignment (Fig. 1) shows 
some sequence differences, which may alter inhibitor specificity. 
In this study, we performed high throughput screening using a li¬ 
brary of —6800 compounds to find five novel inhibitors of the 
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SARS-CoV 3CL pro , 4 of which also inhibited another human CoV- 
229E 3CL pro , but did not inhibit the 3C pro from RV14, CVB3, and 
EV71. But, one compound was found to almost equally inhibit 
these 3CL pro and 3C pro . From computer modeling, we rationalized 
the binding discrepancy of the inhibitors against these proteases. 
The information is useful to further develop more potent individual 
or common inhibitors of 3C pro and 3CL pro of PV and CoV for anti¬ 
viral drug discovery. 

2. Methods 

2A. Expression and purification of the proteases 

Two types of proteases including 3CL pro from SARS-CoV and 
CoV-229E and 3C pro from CVB3, EV71, and RV14 were used to assay 
the inhibitors in this study. The SARS-CoV 3CL pro and EV71 3C pro 
were prepared as reported previously [12,18]. For expressing 
CVB3, RV14, and CoV-229E proteases, the genes were cloned from 
viral cDNAs by using polymerase chain reaction (PCR) as reported 
elsewhere. 


2.2. Primary screening 

For screening, 0.05 pM SARS 3CL pro , 6 pM fluorogenic substrate 
Dabcyl-KTSAVLQSGFRKME-Edans, and 50 pM of approximately 
6800 compounds provided by Korea Chemical Bank (Daejeon, Kor¬ 
ea) were used. Enhanced fluorescence of the reactions in the buffer 
of 20 mM Bis-Tris at pH 7.0 was monitored at 538 nm with excita¬ 
tion at 355 nm using a fluorescence plate reader. The compounds 
which inhibited more than 50% of the protease activity at 50 pM 
were selected for the next assay run at 10 pM. 

2.3. IC 50 determination 

The five hits that inhibited SARS-CoV 3CL pro at 10 pM were also 
evaluated against CoV-229E 3CL pro , EV71 3C pro , CVB3 3C pro , and 
RV14 3C pro . In the assay solution, the activities of these proteases 
(0.5 pM) with 10 pM fluorogenic substrate in the buffers of 
10 mM MES at pH 6.5 and 6.0 (the optimal pH for EV71 and 
RV14 proteases, respectively) and 10 mM HEPES at pH 7.5 (for 
CoV-229E and CVB3 proteases) were measured in the presence 
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Fig. 1. A structure-based sequence alignment of SARS-CoV 3CL pro , CoV-229E 3CL pro , CVB3 3C pro , EV71 3C pro , and RV14 3C pro . The domains according to 3CL pro are shown above 
the sequence and the secondary elements according to the 3C pro structure are shown below it. Arrows indicate the essential catalytic amino acids His and Cys for 3CL pro and 
3C pro , and Glu (only for 3C pro ). 
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of various concentrations of the inhibitors to obtain the IC 50 
values. 

2.4. Computer modeling of the inhibitors binding with the proteases 

For the modeling analysis, we used the crystal structure of SARS 
3CL pro in complex with a peptide inhibitor (PDB code 1UK4) [19], 
the structures of CoV-229E 3CL pro and CVB3 3C pro solved by us, 
and the structural model of EV71 3C pro constructed from the struc¬ 
ture of RV 3C pro (PDB code 1CQQ.) [20]. Docking process was per¬ 
formed using an automated ligand-docking subprogram of the 
Discovery Studio Modeling 1.2 SBD (Accelrys Inc., San Diego, CA), 
with a set of parameters chosen to control the precise operation 
of the genetic algorithm. Docking runs were carried out using stan¬ 
dard default settings “grid resolution” of 5 A, “site opening” of 




12 A, and “binding site” selected for defining the active site 
cavity. 

3. Results 

3.1. Screening of the protease inhibitors 

We first screened against a library of ~6800 compounds for 
inhibiting SARS-CoV 3CL pro . From the primary screening, there 
were 66 compounds which showed more than 50% inhibition of 
the enzyme activity at 50 pM. We further tested their inhibitory 
activities at 10 pM and five of them (21155, 22723, 27548, 
43146, and 48511) showed IC 50 values smaller than 10 pM. 
According to their dose-response curves as shown in Fig. 2A-E, 
the five hits 21155, 22723, 27548, 43146, and 48511 displayed 





Fig. 2. Dose-response curves for the five hits against SARS-CoV 3CL pro from the screening. IC 50 values were determined from the curves using equation 1. These were (A) 
7.2 ±0.7 pM (21155), (B) 10.6 ±1.3 pM (22723), (C) 7.0 ± 0.8 pM (27548), (D) 3.3 ± 0.2 pM (48511), and (E) 8.1 ± 0.9 pM (43146). The structures and activities of these 
inhibitors are summarized in Table 1. 
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Table 1 

Summary of IC 50 values (|iM) of the five hits with SARS-CoV 3CL pro , and other 3C(L) proteases. 



IC 50 values of 7.2 ± 0.7, 10.6 ±1.3, 7.0 ± 0.8, 3.3 ± 0.2, and 
8.1 ± 0.9 jiM, respectively, against the SARS 3CL pro . Similar 
inhibition results were observed for 3CL pro of CoV-229E (data sum¬ 
marized in Table 1), but not for 3C pro . However, 43146 inhibited 
both 3C pro and 3CL pro with IC 50 values of 10.3 ±1.1 pM, 5.4 ± 
0.2 pM, 3.3 ± 0.3, and 5.2 ± 0.6 pM, respectively, against CoV- 
229E 3CL pro , CVB3 3C pro , EV71 3C pro and RV14 3C pro (Fig. 3A-D 
and summarized in Table 1 ). This compound contains a dihydropy- 
razole ring with three substituents, two phenyl groups and a 
lengthy N-butyl-benzimidazolylamino-toluene. 

3.2. Inhibition potencies of the 43146 analogues 

Since 43146 inhibited 3CL pro and 3C pro , its analogues including 
45240, 68638, 55688, and 55585 obtained from another compound 
library were evaluated. As shown in Table 2, all of them showed 
good potencies against the five proteases. The most potent 
compound was 45240, and its IC 50 values in inhibiting the 3C(L) 
proteases were measured to be 2.5 ± 0.2 pM (SARS-CoV 3CL pro ), 
2.6 ± 0.4 pM (CoV-229E 3CL pro ), 1.2±0.3pM (CVB3 3C pro ), 0.5 ± 
0.1 pM (EV71 3C pro ), and 1.7 ± 0.1 pM (RV14 3C pro ) (Table 2). This 
compound contains four rings, three phenyl groups and one imid¬ 
azole, surrounding a central dihydropyrazole ring, without the 
lengthy side chain as seen in 43146. Compound 68638 with ben- 
zylcyclohexane ring fused with the dihydropyrazole ring and acet¬ 
yl and iodobenzyl groups attached to the central ring showed less 
inhibition against both 3C pro and 3CL pro (Table 2). The other two 
compounds, 55688 and 55585, with shorter side chains attached 


to the benzimidazolyl group showed similar inhibitory activities 
as compared to 43146 (Table 2). 

3.3. Computer modeling of21155,22723,27548, and 485II binding to 
the proteases 

These inhibitors are competitive inhibitors with respect to the 
substrate (data not shown), indicating they bind in the active site. 
To rationalize the binding discrepancy of these inhibitors against 
these proteases, their binding modes with SARS-CoV 3CL pro and 
four other proteases were modeled and some of them are shown 
in Fig. 4. The first four inhibitors of SARS-CoV 3CL pro are more rigid 
because the thiazolopyridine in 21155, the dichlorobenzoquinoli- 
none in 22723, the isoindoledione in 27548, and the oxazole in 
48511 adopt planar structures and the three substituents of the 
oxazole ring in 48511 are fixed in a conformer, due to the 1,2-steric 
interaction between the acetate group and the N-aryl imino group 
as well as the biaryl interaction between the phenyl and oxazole to 
prohibit their free rotation. All these compounds can be considered 
as two rigid aromatic moieties connected by a small linker. Based 
on the computer modeling, each of these aromatic moieties is 
bound to SI or S2 site of SARS protease by forming H-bonds and 
hydrophobic interactions (Fig. 4A-D). As shown in the computer 
modeling, Glul66 side chain of SARS 3CL pro forms H-bonds with 
these four inhibitors. However, the corresponding amino acid res¬ 
idue in 3C pro is Glyl64, which lacks the side chain to form H-bond 
with any of these compounds (also see Fig. 4F), leading to loss of 
inhibition. In addition, the 3C pro have more open but shallow S2 
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Fig. 3. Dose-response curves for 43146 against 229E 3CL pro , CVB3 3C pro , EV71 3C pro and RV14 3C pro . IC 50 values were determined from the curves using equation 1. These 
were (A) 10.3 ± 1.1 pM (229E 3CL pro ), (B) 5.4 ± 0.2 pM (CVB3 3C pro ), (C) 3.3 ± 0.3 pM (EV71 3C pro ), and (D) 5.2 ± 0.6 pM (RV14 3C pro ). 


Table 2 

IC 50 values (pM) of compound 43146 analogs with SARS-CoV 3CL pro , and other 3C(L) proteases. 


Compound ID 


Structure 
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Fig. 4. Computer modeling of the binding modes of the inhibitors in the active site of the SARS 3CL pro . The more rigid moieties of the inhibitors 21155, 22723, 27548, 48511 
are probably bound to SI and S2 sites as shown in (A), (B), (C), and (D), respectively. (E) 43146 binds SARS-CoV 3CL pro differently with the biphenyl 4,5-dihydro-lH-pyrazole 
moiety anchored at the SI’ and S2 sites and the rest of the molecule at the S3 and the following sites. The hydrogen bond interactions were represented by yellow dotted lines. 


site (due to its partial blockage by Leul27) than 3CL pro according to 
the crystal structures of RV 3C pro [20] and CVB3 3C pro (Lee et al., 
unpublished results) compared to 3CL pro (also see Fig. 4F). Thus, 
3C pro can not hold these compounds tightly. 


3.4. Binding modes of 43146 and its analogues to the proteases 

In contrast, the compound 43146 is more flexible, because the 
dihydropyrazole is not planar, and the phenyl group is linked to 
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the sp 3 -hybridized carbon of the dihydropyrazole ring, so it is free 
for rotation. Different from the binding modes of the other 4 inhib¬ 
itors, the diphenyl 4,5-dihydro-lH-pyrazole moiety of 43146 fits 
well at the SV and S2 sites in the SARS 3CL pro (Fig. 4E) with the rest 
of the molecule at the S3 site and beyond. With this binding mode, 
the compound was predicted to also bind well in the 3C pro , consis¬ 
tent with the inhibition data. In fact, RV 3C pro prefers a phenyl 
group at the S2 site as evidenced by its strong inhibition by 
AG7088 which has a P2-fluorophenylalanine. Thus, it could be 
rationalized by computer modeling that only 43146 among the five 
hits can inhibit the three 3C pro in addition to the 3CL pro . 

The analogues of 43146, including 45240, 68638, 55688, and 
55585, bind in the 3C pro and 3CL pro active sites with similar modes 
to that of 43146 (data not shown). Compared to 43146, 55688 and 
55585 only have minor structural difference with shorter alkyl 
groups attached to the benzimidazole ring, so that they showed 
similar inhibition against the proteases. The fused ring system 
and the phenyl group in 68638 may also span from SI' to S2 sites 
in both kinds of proteases, yielding similar inhibition. However, 
45240 showed a significantly better inhibition against the 3C pro 
than 43146. Apparently, the lengthy side chain attached to the 
phenyl group in the compound did not provide additional interac¬ 
tion with the protease, consistent with the binding mode shown in 
Fig. 4E. However, the additional interaction is provided by the pyr¬ 
idine ring bound near the more open ST site in 3C pro . 

4. Discussion 

AG7088 is the best inhibitor identified so far for 3C pro , which 
not only inhibits the 3C pro from RV, but also those from CV and 
EV [16]. However, it did not inhibit 3CL pro from SARS-CoV [17]. This 
may be partially due to the blockage of its PI-lactam ring by the 
relatively larger Glul66 side chain and also the S2 site of 3CL pro 
is narrower although it is deeper. Therefore, when the P2-phenyl- 
alanine is changed to non-planar leucine or cyclohexane without 
changing the PI-lactam, they became good inhibitors of SARS 
3CL pro [21]. Unlike AG7088, which is a ketomethyl isostere of a 
tripeptide-conjugated ester, compound 43146 is not peptide¬ 
like. From the random screening as shown in the study, we have 
found a starting point toward the development of non-peptide 
multiple-function inhibitors against CoV and PV. With further 
modification of these individual and common inhibitors of the viral 
proteases, we hope to find solution for the possible reoccurrence of 
SARS and other diseases caused by the viruses with the 3C pro and 
3CL pro . 
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